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Abstract 

Background: Peroxiredoxin 6 (PRDX6) is involved in redox regulation of the cell and is thought to be protective 
against oxidant injury. Little is known about genetic variation within the PRDX6 gene and its association with acute 
lung injury (ALI). In this study we sequenced the PRDX6 gene to uncover common variants, and tested association 
with ALI following major trauma. 

Methods: To examine the extent of variation in the PRDX6 gene, we performed direct sequencing of the 5' UTR, 
exons, introns and the 3' UTR in 25 African American cases and controls and 23 European American cases and 
controls (selected from a cohort study of major trauma), which uncovered 80 SNPs. In silico modeling was performed 
using Patrocles and Transcriptional Element Search System (TESS). Thirty seven novel and tagging SNPs were tested 
for association with ALI compared with ICU at-risk controls who did not develop ALI in a cohort study of 259 African 
American and 254 European American subjects that had been admitted to the ICU with major trauma. 

Results: Resequencing of critically ill subjects demonstrated 43 novel SNPs not previously reported. Coding regions 
demonstrated no detectable variation, indicating conservation of the protein. Block haplotype analyses reveal that 
recombination rates within the gene seem low in both Caucasians and African Americans. Several novel SNPs 
appeared to have the potential for functional consequence using in silico modeling. Chi 2 analysis of ALI incidence 
and genotype showed no significant association between the SNPs in this study and ALI. Haplotype analysis did 
not reveal any association beyond single SNP analyses. 

Conclusions: This study revealed novel SNPs within the PRDX6 gene and its 5' and 3' flanking regions via direct 
sequencing. There was no association found between these SNPs and ALI, possibly due to a low sample size, 
which was limited to detection of relative risks of 1.93 and above. Future studies may focus on the role of PRDX6 
genetic variation in other diseases, where oxidative stress is suspected. 
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Background 

Acute Respiratory Distress Syndrome (ARDS) and Acute 
Lung Injury (ALI) affect 100,000-150,000 patients each 
year in the United States alone [1,2]. ALI is an inflamma- 
tory syndrome characterized by acute respiratory failure 
due to non-cardiogenic pulmonary edema and hypoxe- 
mia [3] . Oxidant stress caused by reactive oxygen species 
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(ROS) is thought to be a major contributor to the patho- 
genesis of ALI. ROS can be generated by inflammatory 
cells or pulmonary endothelium and cause damage to 
proteins, DNA, and lipids [4]. 

The risk of developing ALI/ARDS is not uniformly dis- 
tributed in the critically ill population, suggesting a 
genetic influence on outcomes [5]. Peroxiredoxins are a 
superfamily of non-heme and non-selenium peroxidases 
that are widely distributed throughout all phyla [6]. The 
Peroxiredoxin 6 gene (PRDX6) is located on chromosome 
lq24 and is approximately 12 Kb in length, containing 5 
exons. The Prdx6 protein encoded by PRDX6 is involved 
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in redox regulation of the cell and has been shown in cell 
and animal models to be protective against oxidative 
injury [7]. Prdx6 also has phospholipase A 2 activity and 
has an important role in lung surfactant metabolism [7] . 
The protein product, Prdx6, has been shown to affect the 
cellular level of H 2 0 2 produced in cells stimulated with 
platelet-derived growth factor or tumor necrosis factor- 
a, and modulating signaling induced by those ligands [8], 
thus indicating that Prdx6 can have an effect on cytokine 
levels and cell signaling cascades. Recent studies suggest 
that Prdx6 is only active following heterodimerization 
with glutathione-S-transferase pi, indicating that there is 
an important interaction between Prdx6 and GSTpi [6]. 
Despite these important functions, little is known about 
genetic variation within the PRDX6 gene [9] and its asso- 
ciation with ALI. 

In order to determine if variation within PRDX6 is 
associated with ALI risk in either the African American 
(AA) or European American (EA) populations, we per- 
formed direct sequencing of the 5' UTR, exons, introns, 
and the 3' untranslated region (UTR) in 48 subjects (25 
African Americans and 23 European Americans) and 
identified 80 variants, many of which have not been pre- 
viously reported. Eighteen of the eighty variants, along 
with 19 tagging SNPs selected using HapMap http:// 
hapmap.ncbi.nlm.nih.gov/, were tested for association 
with ALI using a custom genotyping platform. 

Methods 

Patient population 

Between 1999 and 2006, patients were enrolled in a major 
trauma cohort study designed to study molecular risks for 
acute lung injury [10-12]. Participants met the following 
inclusion criteria: 1) admission to the intensive care unit 
(ICU) as a result of acute trauma directly from the field or 
via that hospital's Emergency Department; and 2) have an 
Injury Severity Score (ISS) > 16 as calculated on the basis 
of information available during their first 24 hours of hos- 
pitalization. The following demographic and clinical vari- 
ables were collected upon admission to the ICU: age, 
gender, ISS, blunt mechanism, and acute physiology and 
chronic health evaluation (APACHE) (Table 1). Exclusion 
criteria were death or discharge from the ICU within 
24 hours of admission, less than 13 years of age, current 
or past evidence of congestive heart failure (CHF) or 
recent acute myocardial infarction, severe chronic respira- 
tory disease, morbid obesity, burns on over 30% of 
the total body surface area, and lung or bone marrow 
transplant [10]. 

The definition of ALI was in accordance with the 
American European Consensus Conference (AECC) [3]. 
ALI and ARDS were defined as: acute onset; bilateral 
pulmonary infiltrates on chest X-ray consistent with 
pulmonary edema; absence of evidence of left atrial 



Table 1 Clinical data for individuals enrolled in the study 
by ancestry 



ALI No ALI 



African Americans (N = 285) 


(N = 71) 


(N = 1 


87) 


Age (± SD) 


367 ± 17.5 


31.9 ± 


12.4 


l^pnnpr 10/r, j~iT ma be 1 

vJCIICJCI \/U CJI 1 1 ICt 1 C.3 1 


32 


Q/| 




ISS (± SD) 


25.3 ± 8.9 


22.8 ± 


6.3 


Blunt Mechanism (%) 


25 


59 




(% of Blunt Trauma) 








APACHE (± SD) 


45.4 ± 12.4 


40.2 ± 


13.5 


European Americans (N = 269) 


(N = 86) 


(N = 1 


68) 


Age (± SD) 


41.8 ± 19.9 


39.3 ± 


18.5 


Gender (% of males ) 


27 


61 




ISS (± SD) 


26.9 ± 7.5 


25.2 ± 


7.2 


Blunt Mechanism (%) 


-1 


7 




APACHE (± SD) 


41.9 ± 14.7 


35.7 ± 


12.8 



hypertension; and poor systemic oxygenation, and a 
ratio of arterial oxygen (Pa02) to the fraction of 
inspired oxygen (Fi02) less than or equal to 300 for ALI 
and 200 for ARDS [3], All chest x-rays were reviewed 
independently by 2 trained observers. In our population, 
greater than 85% of subjects meeting criteria for ALI 
also met criteria for ARDS. 

Clinical Data and Biosample Collection 

Clinical data were collected by trained study nurses 
using a standardized research case report form designed 
for the trauma cohort study. Blood for analysis was 
obtained from residual blood samples in tubes contain- 
ing ethyledenediaminetetraacetic acid (EDTA) that had 
been previously drawn for other clinical purposes. Study 
personnel collected residual samples each day, centri- 
fuged, and separated the buffy coat layers, which were 
frozen at -80°C [10]. All clinical and biosample collec- 
tion protocols were approved by the institutional review 
board (IRB) at the University of Pennsylvania School of 
Medicine under a waiver of informed consent. 

PRDX6 resequencing 

Genomic DNA was extracted from whole blood using 
Qiagen Qiamp DNA Blood Midi Kits (Qiagen USA) and 
stored in the provided tris-EDTA buffer. DNA from 25 
African American and 23 European American subjects 
selected from the major trauma cohort, with ALI status 
equally distributed within each group, were selected for 
sequencing of PCR fragments, providing a power of 99% 
to detect minor allele frequencies of at least 5% [13]. 
PCR primers for 4 Kb upstream of the ATG start site, 
all exons, introns, and 4 Kb of the 3' UTR were 
designed using PCRoverlap (Children's Hospital of Phi- 
ladelphia (CHOP) bioinformatics core) to generate 
amplicons between 600 and 800 bp that overlapped by 



Rushefski et al. BMC Medical Genetics 201 1, 12:77 
http://www.biomedcentral.eom/1 471-2350/1 2/77 



Page 3 of 1 5 



at least 100 bp (Additional file 1). Following primer 
optimization, DNA was amplified and sequenced in the 
forward and reverse direction using a 3730 automated 
sequencer at the CHOP Nucleic Acid and Protein Core 
facility (Philadelphia, PA). Sequencher 4.8 (Gene Codes 
Ann Arbor, MI) was used to facilitate secondary peak 
calls and to compare the sequence data to the NCBI 
reference sequence. 

SNP genotyping 

Novel variants with a minor allele frequency (MAF) of 
0.04 or higher and tagging SNPs, from HapMap http:// 
hapmap.ncbi.nlm.nih.gov/, were validated using the 
SNPlex genotyping system (Applied Biosystems Inc. 
Foster City, CA). Tagging SNPs were selected using 
Tagger's pairwise testing methods described by Bakker 
and colleagues [14]. Genotyping novel variants not only 
served to test for association, but allowed us to validate 
those SNPs in a larger population. Tagging SNPs were 
also selected to provide better coverage of the haplotype 
structure of PRDX6. SNPlex utilizes an oligonucleotide 
ligation/PCR assay with universal ZipChute probe detec- 
tion to perform genotyping of up to 48 SNPs in a single 
reaction. ZipChute probes were custom designed and 
detected by capillary electrophoresis using the Applied 
Biosystems 3130 Analyzer and genotype calls were 
determined using Gene Mapper 4.0 (Applied Biosystems 
Foster City, CA). 

All genotyping was performed in the University of 
Pennsylvania's Laboratory for Molecular Epidemiology 
(LME). Staff was blinded to the disease status and geno- 
typing calls were performed in subsamples by plate. 
Each plate contained six positive controls to test for 
concordance. Genotyping calls were performed automa- 
tically using the algorithm described by Da La Vega and 
colleagues [15]. 

In silico modeling of putative function in SNP sites 

We sought to test inferred function in silico using tran- 
scription factor binding and mRNA binding tools. TESS 
is a web-based software tool for locating possible tran- 
scription factor binding sites in DNA sequences using 
weight matrix models. It can also be used for browsing 
information about relevant transcription factors in the 
TRANSFAC database [16]. All SNPs discovered within 
the 5' UTR and the first intron were submitted to TESS 
as 21 base pair long FASTA sequences with the refer- 
ence allele of the SNP of interest in the 11 th position. A 
second search was performed using the alternative allele 
in the 11 th position. To eliminate any poor matches due 
to background noise, transcription factors with log-like- 
lihood scores (La) less than 12, were eliminated. TESS 
results were compared with experimental transcription 
factor binding site (TFBS) data registered in the 



University of California Santa Cruz (UCSC) Genome 
Browser by the Encyclopedia of DNA Elements 
(ENCODE) consortium [17]. The ENCODE data were 
filtered by chromosome and position. 

A search query was performed for potential miRNA 
binding sites in the 3' UTR of PRDX6 using Patrocles 
http://www.patrocles.org/. Patrocles is an online database 
containing DNA sequence polymorphisms that are pre- 
dicted to interrupt miRNA-mediated gene regulation 
[18]. The search was performed using "PRDX6" as a key 
word in the target gene id field and miRNA target motifs 
were defined by Xie et al. [19] and Lewis et al [20]. 

Statistical Analysis of ALI association 

259 African American and 254 European American sub- 
jects enrolled in the major trauma cohort were used to 
test for association of novel variants and tagging SNPs 
with ALI. Association of each PRDX6 SNP with ALI 
was determined separately for European Americans and 
African Americans using an additive model Chi 2 test, 
with a p-value < 0.0014 for African Americans consid- 
ered significant. Dominant and recessive inheritance 
models were also tested using Chi analysis. Multivari- 
able analyses of potential confounding were performed 
using logistic regression methods. Power was calculated 
using the power for genetic association analyses (PGA) 
[21]. Using PGA, we estimated that a sample size of 250 
subjects per race category would provide 80% power to 
detect relative risks of 2.26 or greater for SNPs with a 
prevalence of 0.05 or greater and 1.93 or greater for 
SNPs with a prevalence of 0.10 or greater, assuming a 
Bonferroni-corrected alpha = 0.0014 for African Ameri- 
cans, and an incidence of ALI = 0.30 (Additional file 2). 
These statistical analyses were performed using STATA 
11 (STATA Data Corp, College Station, TX). Pairwise 
linkage disequilibrium was evaluated using Haploview 
http://www.broadinstitute.org/mpg/haploview. Geno- 
types with a completion rate of 95% or greater were 
considered for analysis in Haploview. LD was calculated 
in terms of r values and blocks were defined using the 
default algorithm using the confidence intervals methods 
of Gabriel and colleagues [22]. 

Haplotypes were inferred using the standard expecta- 
tion maximization algorithm in Haploview [23,24] and 
the following confidence interval (CI) criteria: CI 
minima for strong LD: 0.7 - 0.98; upper CI maximum 
for strong LD: 0.98; fraction of strong LD in informative 
comparisons > 0.95; and exclude markers with minor 
allele frequency (MAF) < 0.05. Haplotypes were tested 
for association with ALI first in a global association test, 
which performed contingency testing using all haplo- 
types of an LD block compared to no haplotypes, and 
then as individual haplotypes versus ALI coded in an 
additive fashion PLINK [25]. Haplotype multiple testing 
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was addressed by applying permutation tests (10,000 
permutations). 

Results 

Identification of novel polymorphisms in PRDX6 

Direct sequencing of the PRDX6 gene in 48 subjects 
revealed 80 genetic variants, none of which were in cod- 
ing regions (31 in the 3' UTR, 22 in the 5' UTR and 27 
intronic) (Additional file 3). The variants identified via 
direct sequencing were compared with those registered 
in the NCBI dbSNP database (Build 130) and 43 were 
found to be novel SNPs (Table 2). Thirty seven were 
matched with SNPs catalogued in dbSNP (Build 130) 
and Genewindow http://genewindow.nci.nih.gov/Wel- 
come based on chromosome position (Table 3). Twenty 
five of the novel SNPs uncovered had a MAF > 0.04 and 
were submitted to the NCBI to be catalogued and 
assigned ss numbers in the submitter records section of 
dbSNP (Table 2). Novel SNPs were also compared with 
SNPs registered in the 1000 Genomes database. Thirty 
six out of thirty seven known SNPs overlapped with 
SNPs registered in 1000 Genomes, but only sixteen out 
of forty-three novel SNPs identified via our sequencing 
effort were also registered in 1000 Genomes (Table 4). 
Several variants were only observed in one individual. 
As a quality control measure we present the confidence 
scores for these genotypes in additional file 4. Confi- 
dence scores are reported as the percentage of overlap 
between heterozygote peaks. Previous studies indicated 
that two transcription factor binding sites, the ARE1 
(-357 to -349) [26] and GRE2 (-750 to -738) (A. Fisher, 
unpublished observations), may play a role in the regula- 
tion of PRDX6. We were unable to sequence the ARE1 
region and portions of the intronic regions due to the 
GC rich content of the flanking sequence (Figure 1). 
The GRE2 region was successfully sequenced, but 
showed no variation. 

In Silico function of novel SNPs in PRDX6 

The TESS results showed several potential transcription 
factor binding motifs in both the reference and alterna- 
tive sequence. The reference and alternative sequences 
were submitted as independent queries and transcrip- 
tion factors were returned for 19 positions in the refer- 
ence sequence and 21 positions in the alternative 
sequence (Table 5). Twenty seven out of twenty nine 
sequences submitted were shown to create, abolish, or 
change a transcription factor binding site. Fourteen of 
these SNPs were novel. Comparison of the transcription 
factors returned from the TESS query with the data 
from ENCODE showed that only 3 of these putative 
transcription factor binding sites have been tested by 
the ENCODE consortium, SP1, GATA-1, and c-Myc. 



ENCODE data for SP1, GATA-1, and c-Myc revealed 
that there is no evidence of binding affinity with the 
sequence results from the PRDX6 gene when filtered for 
PRDX6 and A549 cells. 

A Patrocles miRNA database search for PRDX6 
revealed eight SNPs in the 3' UTR of PRDX6 as poten- 
tial miRNA binding sites (Table 6). Of the eight SNPs 
returned from the search query, three matched SNPs 
from this study (rs4611, rs36005931, and rs2000). 
rs4611 and rs36005931 are located within octamers that 
have been conserved among several species, but do not 
correspond to a known miRNA. The G allele of rs2000 
is part of an octamer capable of binding miR-942. A lit- 
erature search for miR-942 returned only sequence data, 
with no known function to date. 

Association of PRDX6 with ALI 

The trauma cohort described in Table 1 was genotyped 
for 37 PRDX6 SNPs using SNPlex. All SNPs were tested 
for Hardy- Weinberg equilibrium (Additional file 5). Chi 2 
analysis of incidence of ALI compared to genotype 
using an additive model showed no significant associa- 
tion between any of the SNPs in this study and ALI 
(Table 7). Dominant and recessive models failed to 
demonstrate an association between our SNPs of inter- 
est and ALI (Additional file 6). The genotype concor- 
dance rate based on assay positive controls was 100% 
and the frequency of missing genotypes is presented in 
Table 7. Logistic regression analysis after adjustment for 
age and ISS showed no association between ALI and 
our SNPs (Table 8). 

Haplotype Analysis 

Haplotype blocks were created for both African and 
European Americans using 27 and 28 SNP markers, 
respectively. Haplotype blocks were created for a region 
spanning 100.6 kb of chromosome 1. For African Amer- 
icans, there were 14 SNP markers with genotyping com- 
pletion rate of less than 95% and were thus excluded 
from the haplotype analysis. For European Americans, 
there were 9 SNPs with a genotype completion rate of 
less than 95% and were excluded from the haplotype 
analysis. 

In African Americans, there were 2 blocks (blockl = 
rs4916362 and rsl0753081; block2 = P7?DX6_171711459, 
rs34619706, hCV9040425, rs35244306, rs9425725, 
rs912767, rs2000, hCV1948447, rs6702828, rs6702835, 
rs7521536, rs7529377) (Figure 2). In European Americans, 
there were 2 blocks (blockl = rs4916362, rsl0753081, 
PRDX6_171711459, rs34619706; block2 = rs33942654, 
PRDX6_171715019, hCV9040425, rs35244306, rs9425725, 
rs912767, rs2000, hCV1948447, rs6702835, rs7521536, 
rs7529377) (Figure 3). The haplotype structure of PRDX6 
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Table 2 Novel SNPs discovered via direct sequencing 



Novel SNPs 


Region 


Chr. Position 


SNP 


5' Flanking Sequence 


3' Flanking Sequence 


MAF in EA 


AA A C In A A 

MAF in AA 


ss# (dbSNP) 


PRDX6_ 


1 7 1 i r\or a 1 

.1 71 709541 


5' UTR 


171 7AAr A 1 

1 / I /U9541 


G/C 


t — rrv a a c t — rrv 
C 1 1 CAAGG 1 1 C 


A Cf~( — ITATA W 

AGGG 1 1 A 1 AGG 


0.04 


0.02 


ss21 7326279 


PRDX6_ 


J 71 709910 


5' UTR 


171 ~?r\r\r\ 1 r\ 

1 /I /09910 


G/T 


a TV a t v a 1 " 1 1 " 1 " 1 

AIGAICAI 1 1 1 1 


/"A A ATATATAT 

GAAA 1 A 1 AGAG 


0.00 


0.10 


SS2' 


7ii/" inn 

7326288 


PRDX6_ 


1 "71 "71 (~i10"7 

.171710327 


5' UTR 


I / I / I 032/ 


C/T 


ACCC 1 AGCCCC 


~T( — rv ( — rv f v a 
1 G 1 GG IGGGA 


0.04 


0.00 


SS2' 


71 1 /" 171 

7326273 


PRDX6_ 


.171710490 


5' UTR 


1 71 1 1 t~\A r\r\ 

1/1/1 049U 


C/T 


TW A 1 — IVW V A 

1 GCAC 1 GCGGA 


f~ c kc v c a ft — r 
GGAGGGAGGI 


0.00 


0.06 


SS2' 


71 1 /* i n i 

7326283 


PRDX6_ 


1 "7 1 "7 1 C\~~!~l r 

.171710775 


5' UTR 


1 71 71 f~\~!~!C 

1/1/ 10//b 


G/C 


f — rTATrri — rv v 
C 1 1 A 1 GGG IGG 


— r-t~ AfATA TV 

G 1 GAGAGA 1 G 


0.00 


0.02 






PRDX6_ 


n -71 "71 dn 1 1 

.171710821 


5' UTR 


1 7 1 7 1 IIOI 1 

I / I /1 082 1 


C/T 


A / — IV V A ( — TV A V 

AL 1 GCAC 1 GAG 


TT/ — TV — r AAA i' — T 

1 1 G 1 G 1 AAAG 1 


0.00 


0.10 


SS2' 


711/" 1A1 

7326292 


PRDX6_ 


1 ~71 ~7i i r\i n. 

.1 71 71 1029 


5' UTR 


17171 imn 

I /j /ll 029 


C/T 


A 1 — l"V A V A V A VV 

AG 1 GAGAGAGG 


c c t — rvvTV w 
GGG 1 GG IGGG 


0.00 


0.02 






PRDX6_ 


1 7 1 -71 1 1 ^ r 

.1 71 71 2345 


5' UTR 


17171 ni/ir 

1 /I /1 2345 


A/T 


A "TV / — l~T V ATA A 

A 1 GG 1 1 GA I AA 


AVA A AVVVVA 

AGAAAGGGGA 


0.87 


0.64 


ss2' 


7326296 


PRDX6_ 


i "71 ~7i i ^ n. ^ 

.171713694 


intron 


1 71 71 1 r C\A 

1 / 1 / 1 3694 


G/T 


TV A V 1 "1 VWW 

IGAGI IGGGGG 


a i" — rv Cf~C A vv 

AG IGGGGAGG 


0.00 


0.04 


ss2' 


7326300 


PRDX6_ 


1 "7 1 "71 ")"7TO 

.1 71 71 3738 


intron 


1 7 1 7 1 1 710 

I /I /1 3/38 


G/T 


/-/ — VCf~( — TT7 — TV 

GG 1 GGG 1 1 G 1 G 


TV ^ — TV V f — TV V 

1 GG 1 GGG 1 GG 


0.00 


0.02 






PRDX6_ 


1 "71 -71 -i ri7o 

.1 71 71 3872 


intron 


1 71 71 T071 

1 /I /1 38/2 


C/T 


V V AT A A A AT/ — T 

GGAGAAAA 1 G 1 


TA A A AVVA f — T A 

1 AAAAGGAG 1 A 


0.00 


0.1 2 


ss2' 


7326323 


PRDX6_ 


i "7 1 -71 i *i 1 n. 

.171713919 


intron 


1 71 71 iai n 

I /I /1 391 9 


G/C 


A A A V A f 1 1 1 1 1 V 

AAAGAGI 1 1 1 IG 


a vvvv v/ — rw 
AGGGGGG 1 GG 


0.02 


0.00 






PRDX6_ 


1-71-71 /I 1 /1 7 

.171714107 


intron 


1717 1/11 l"\7 

I /I /I4l 0/ 


C/T 


/~i*~ a /~/— a f- a — r 
GGAGGAGAGG 1 


TWW A A / — III 

IGGGGAAGI 1 1 


0.00 


0.04 


ss2" 


7326304 


PRDX6_ 


1-71-71 >inn /i 

_l 71 714984 


intron 


1717 1/1 no /I 

1/1/1 4984 


C/T 


V A TV A A A A / — l"V 

GA 1 GAAAAG 1 G 


TT A TV A VW A V 

1 IAIGAGGGAG 


0.04 


0.04 


ss2' 


7326307 


PRDX6_ 


1 -71 -71 r - r\ 1 n. 

.171715019 


intron 


1 71 71 rmn 
I /] /] 501 9 


A/G 


ArrA a v a Wi - — r 

AGGAAGAGGG 1 


T A TT — TV V A 1 ' 1 ' 1 

1 A 1 G 1 GGA 1 1 1 


0.00 


0.10 


ss2' 


7326318 


PRDX6_ 


1 -71 -71 r r r\r 

.1 71 71 5596 


intron 


1 71 "71 c n c\c 

1 / 1 / 1 5596 


A/G 


V V V AT V 'V A A/" 

GGGAGGGAAG 


TV A A ( — TV V f — TT 

1 GAAG 1 GGG 1 1 


0.00 


0.02 






PRDX6_ 


1-71-71 s~ r\r\~? 

.171 71 6007 


intron 


1717 1/" (Id 7 

1 / I / I 600/ 


A/G 


AAA V/ — ITW V A 

AAAGG 1 1 GGGA 


r — rw v a vvvv 
G 1 GGGAGGGG 


0.00 


0.04 


ss21 7326311 


PRDX6_ 


1 -71 -71 /" 1 

_1 71 71 6032 


intron 


1 71 71 Cf\ "3*1 

1 /I /1 6032 


G/C 


T A A / — T A V V A A V 

1 AAG 1 AGGAAG 


TV V V t — TT/ — TV — T 

1 GGGG 1 IG IG 1 


0.00 


0.02 






PRDX6_ 


1-71 71 /^rr ^ 

.171716554 


intron 


1 71 71 £ C C A 

I /I /1 6554 


A/C 


AVA A V V V A Ac — T 

AGAAGGGAAG 1 


A A/ ITT A ATTTT 

AAGI 1 IAAI 1 1 1 


0.00 


0.02 






PRDX6_ 


1 "71 "71 f~ I — 

_l 71 71 6572 


intron 


1 71 71 /T71 

1 / 1 / 1 65 1 1 


A/T 


TV A A C — PT — T A A TT 

1 GAAG 1 1 1 AA 1 1 


TA A ATAVA AVA 

1 AAA 1 AGAAGA 


0.00 


0.02 






PRDX6_ 


1 -71 -71 n r n 1 

_l 71 71 6582 


intron 


1 71 71 /"COT 

1 /I /1 6582 


A/G 


TT A fATATA A AT 

1 1 AGAI Al AAAI 


ATAVA A A V VT A 

A 1 AGAAAGG 1 A 


0.00 


0.04 


ss2' 


7326315 


PRDX6_ 


1 "71 "71 /TO/ 

.1 71 71 6584 


intron 


I /I /1 6584 


A^ 


ATATATA A ATAf 

AGA 1 A 1 AAA 1 AG 


A V A A A (~( — T A TT 

AGAAAGGIAI 1 


0.00 


0.02 






PRDX6_ 


1 -71 -71 f f r\~i 

.1 71 71 6603 


intron 


I / 1 / 1 6603 


A/G 


A AT/ — r A 1 " 1 T ATT 
AAGG 1 A 1 1 1 A 1 1 


A TT A V A T A ATTT 

A 1 1 AGA 1 AA 1 1 1 


0.00 


0.02 






PRDX6_ 


1-71 7111 n 

_l 7 1 723 1 5 1 


Intron 


1 71 7*1 "3 1 C 1 

I /I /23151 


C/T 


AAA V 1 — T A rfAT 

AAAGG 1 AGGA 1 


TW AVA AVA A 

1 GGAGAAGAA 


0.00 


0.02 






PRDX6_ 


1 -71 Tin a 

.1 71 723403 


Intron 


1 71 Ti^/im 

1 /I /23403 


C/T 


VTT V A TT A f — l~ VT 

G 1 1 GA 1 1 AG 1 G 1 


AVVA Vf — TV — T A V 

AGGAGG 1 G 1 AG 


0.00 


0.02 






PRDX6_ 


1 -71 7-iTnnn 

.1 71 723889 


3 'UTR 


1 71 711 OOf~l 

1 /I /23889 


G/T 


A A A A VTV A A A T 

AAAAG 1 GAAA 1 


V V A Tf 1 — TVTV V A 

GGA 1 G 1 G 1 GGA 


0.00 


0.04 






PRDX6_ 


1 -71 7n^i n 

.171723918 


3'UTR 


1 71 7iir\i n 

I /I /23918 


A/G 


GG 1 1 G 1 GAGGA 


i — rv ata r ri i — r 
G 1 GA 1 Al 1 1 G 1 


0.00 


0.02 






PRDX6_ 


1 -71 71 / A/^n 

.1 71 724000 


3 'UTR 


17171/1 nnn 
1 /I /24000 


G/C 


TA A f — re — TCt — TAT 

1 AAG 1 G 1 GG 1 A 1 


TV I — TV — TV ( — TV — T 

1 GG 1 G 1 GG 1 G 1 


0.00 


0.04 


ss2" 


7326227 


PRDX6_ 


1 -71 7 1 /i 1 in 

.1 71 724128 


3'UTR 


171 71 A 1 1 O 

1 /I /241 28 


G/T 


IIIIIIIAATAT 

1 1 1 1 1 1 1 AA 1 A 1 


TV A TV A V A V A A 

IGAIGAGAGAA 


0.00 


0.04 


ss2' 


7326232 


PRDX6_ 


1 -71 71 ^ 1 01 

.1 71 7241 82 


3'UTR 


171 71 A 1 Ol 

1 /I /241 82 


A/G 


fATA 1 U IIIIA 

GA 1 A 1 1 G 1 1 1 1 A 


Tt — TTV A TV AVA 

1 G 1 1 GA 1 GAGA 


0.00 


0.04 


ss2" 


7326236 


PRDX6_ 


1 "7 1 "7*1 /I 1 OjC 

.1 71 724286 


3'UTR 


171 71 A IO/" 

1 /I /24286 


A/T 


TTr 1 — T A T A A A A A 

1 IGGI Al AAAAA 


1 T 1 / — rr A T A A V 

1 1 1 G 1 GA 1 AAG 


0.00 


0.02 






PRDX6_ 


1 71 71 dHyiri 

.171 724949 


3'UTR 


1 71 AC\AC\ 

1 /I / 24949 


C/T 


a ( — vr — r a r — r a ata 

AG 1 G 1 AG 1 AA 1 A 


V A V/ — 1 ' 1 ' 1 A V A A 

GAGG 1 1 1 AGAA 


0.26 


0.00 


ss2" 


7326270 


PRDX6_ 


1 71 inn 11 

.171725122 


3'UTR 


171 71C 111 

1 / 1 /251 22 


G/T 


r~ c a f~t — Tf~ r — rrr~ 
GGAGGIGGI IG 


T"T7 — f A f — TTT V V 

1 IGIAGI 1 IGG 


0.00 


0.04 


ss2' 


7326239 


PRDX6_ 


1 71 7ir 1 n 1 

.1 71 725183 


3'UTR 


1 71 7in 07 

I / I /25l 83 


C/T 


f cc ATr ATrr r~ 
GGGA IGAI GGG 


i — rr — TV ATA A V V 

G 1 G 1 GA 1 AAGG 


0.00 


0.04 


ss2" 


7326242 


rr\UAO_ 


1 71 77^7^7 
_ 1 / 1 / Z DZ j / 


3'UTR 


171 71 c 1 r 7 

1 /I /2525/ 


A/T 


f~f — V(~(~f~ AAA w 

GG 1 GGGAAAGG 


VATWA A ATAV 

GA 1 GGAAA 1 AG 


0.00 


0.0-1 


ss2 17326247 


PRDX6_ 


.171725485 


3'UTR 


1 71 77^48^ 

1 / 1 / /JtOJ 


r/r 






0 00 


0 06 


ss2' 


7326250 


PRDX6_ 


.171725681 


3'UTR 


171725681 


G/T 


AIAI 1 1 1 IAI IG 


TAGAATAATGT 


0.00 


0.02 






PRDX6_ 


.171725826 


3'UTR 


171725826 


A/G 


TCTGGGGAATG 


TTTGAAAGAGA 


0.04 


0.00 


ss2" 


7326254 


PRDX6_ 


.171727146 


3'UTR 


171727146 


G/C 


CTGTGATTCCT 


TTGTGGTCTTG 


0.02 


0.00 






PRDX6_ 


.171727831 


3'UTR 


171727831 


C/T 


ATGCATGGGAT 


ATTATCCTCTA 


0.04 


0.02 


ss21 7326261 


PRDX6_ 


.171728416 


3'UTR 


171728416 


G/T 


CCTCATTAGGG 


CTCTTAGCCCT 


0.04 


0.02 


ss21 7326266 


PRDX6_ 


.171728455 


3'UTR 


171728455 


G/C 


AATCGGGAGGC 


TGTTAACAGGT 


0.00 


0.04 


ss21 7326257 


PRDX6_ 


.171729049 


3'UTR 


171729049 


C/T 


GTTCTTAAACTA 


AATAGCATGAG 


0.00 


0.02 
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Table 3 Known SNPs discovered via direct sequencing 



Knowns SNPs 


Region 


Chr. Position 


SNP 


MAF in EA (discovery) 


MAF in AA (discovery) 


MAF in EA (dbSNP) 


MAF in AA (dbSNP) 


i-r- 1 O O 7 C A A ~7 

fSl 33/044/ 


5' 


UTR 


1 "7 1 7aao a/" 

I / 1 /09896 


A/G 


0.00 


0.06 


0.01 


0.09 


,„ 1 A"7r i ao i 

rsl O/bJOol 


5' 


UTR 


1 71 71 A1 T/l 

I / I / I Ul 34 


C/T 


0.39 


0.30 


0.73 


0.21 


rS3428zDOO 


5' 


UTR 


1 7 1 7 1 AO 1 A 

1/1/1 Uo i y 




0.07 


0.00 


NA 


NA 


rs94zb/zz 


5' 


UTR 


1 7 1 7 1 1 ~~>f o 

I / I / I I 268 


C/T 


1 .02 


0.70 


0.04 


0.24 


rsibl bz/Ol 


5' 


UTR 


1 1 1 1 1 1 ~>r c\ 

I / I / 1 1 lt>9 


A/G 


0.02 


0.00 


0.00 


0.00 


rsl 2/39142 


5' 


UTR 


171711 no 
I / I / I I 2/8 


A/G 


0.04 


0.00 


0.02 


0.04 


rs43b4b/2 


5' 


UTR 


17171 1 A n <~\ 

I / I / I 14b9 




1 .02 


0.66 


0.00 


0.04 


rs3461 9/06 


5' 


UTR 


17171 1 r 7 a 

I / I / 1 1 6/U 


A/G 


0.09 


0.02 


0.07 


0.04 


rs43o2/o6 


5' 


UTR 


17171 1 /Tin 

I / I / M 699 


C/T 


0.54 


0.28 


0.30 


0.50 


rsl 33/6392 


5' 


UTR 


17171 1 7A1 

I / I / 1 1 /Ul 


C/T 


0.00 


0.06 


0.00 


0.04 


„1 1 C 7£ 1 7 A 

rsl 1 b/ol /4 


5' 


UTR 


1 71 71 11 A1 

I / I / I 2801 


G/T 


0.04 


0.00 


0.15 


0.00 


rs349/ /864 


5' 


UTR 


1 71 71 1 /I c r 

1/1/1 2466 


G/T 


0.00 


0.06 


0.00 


0.1 0 


vr -~) C A A 1 r/£I 

rs3b44l b46 


5' 


UTR 


17171 1/" C 1 

1 /I /Mool 


C/T 


0.00 


0.06 


0.00 


0.03 


, r Tn n 7ti r 
rS3Dl 33/33 


5' 


UTR 


1 71 71 171 O 

1 / 1 / 1 2/88 


C/T 


0.00 


0.06 


0.00 


0.00 


rsoo/l 1 41 


intron 


i7i7iorr7 
I / I / I 333/ 


G/T 


0.26 


0.26 


0.24 


0.30 


-) r -i A m / "1 

rs3b/49242 


intron 


17171/1171 

I / I / I4l /I 


A/G 


0.00 


0.1 2 


0.00 


0.03 


rS3391 8328 


intron 


17171 /I 1 AA 

I / I / 1 41 99 


A/G 


0.00 


0.02 


0.00 


0.1 0 


rs3b899o98 


intron 


17171/1 17A 

1/1/142/9 


C/T 


0.00 


0.10 


0.00 


0.00 


rS3394ZD34 


intron 


17171 AOO A 

I / I / I 4884 


A/G 


0.26 


0.1 8 


0.23 


0.33 


Tr£ 7aaao 

rs33b/990o 


intron 


1 7171 C A 1 "D 

1 /I /loUl 3 


A/G 


0.00 


0.02 


0.00 


0.00 


vr -r\ /linn 

rs942b /23 


intron 


17171 ri 10 

I / I / I bl 1 8 


A/G 


0.33 


0.62 


0.23 


0.63 


rs942b / 24 


intron 


nni n -JO 
1 / 1 / 1 3l 28 


A/G 


0.33 


0.54 


0.23 


0.53 


rS/34(J(JD3 


intron 


nni ccrn 

I /I /Ib6b9 


A/G 


0.33 


0.56 


0.23 


0.53 


rs3b 2448 1)6 


intron 


17171/ — 1/17 

1 /I /1624/ 


C/T 


0.00 


0.14 


0.00 


0.07 


rs461 1 


3'UTR 


171 711/' /I A 

1 / 1 / 28640 




0.33 


0.50 


0.77 


0.47 


rS3033330 


3'UTR 


17171/1171 

1 /I /241 /2 


U- 


0.00 


0.00 


0.05 


0.04 


rs73 1 4 


3'UTR 


171 71 A 111 

1 / 1 1 24 III 


A/G 


0.20 


0.50 


0.23 


0.52 


rS3O00b98l 


3'UTR 


1 71 71/111/1 

1 /I /24224 


A/G 


0.00 


0.04 


0.00 


0.1 0 


rszUUU 


3'UTR 


171 71 A A C 7 

1 / 1 /244b / 


A/G 


0.04 


0.00 


0.03 


0.00 


1 i3 i + 1 Z :7303 


3'UTR 


1 71 794790 

I / I / Z'HV ZU 




0 1 1 


0 00 


0 02 


0 03 


rs9425727 


3'UTR 


171725216 


G/C 


0.02 


0.14 


0.00 


0.04 


rs35358649 


3'UTR 


171725429 


C/T 


0.02 


0.00 


0.00 


0.00 


rs35547740 


3'UTR 


171725569 


(-/D 


0.00 


0.04 


0.00 


0.07 


rs6702835 


3'UTR 


171725723 


A/G 


0.33 


0.48 


0.23 


0.54 


rs60587131 


3'UTR 


171726777 


G/C 


0.00 


0.02 


0.05 


NA 


rs57032935 


3'UTR 


171726836 


G/C 


0.15 


0.22 


0.18 


0.31 


rs6664925 


3'UTR 


171729033 


A/G 


0.00 


0.18 


0.00 


0.16 



appears to be in low LD, in both subgroups. Haplotype 
analyses of association with ALI did not reveal any signifi- 
cant associations above and single SNP analyses (Table 9). 

Discussion 

Prdx6 is a member of the thiol-specific antioxidant 
protein family and in overexpressing cell and mouse 
models has been shown to be protective against oxi- 
dant stress which null models show sensitivity to 



oxidants [7,9,27]. Thus, PRDX6 is a suitable candidate 
gene for ALI risk. The extent of genetic variation 
within PRDX6 remains largely unknown, therefore we 
performed direct sequencing of the PRDX6 gene, and 
identified novel variants for future study. We also 
tested the newly discovered SNPs and tagging SNPs 
for association with ALI using our trauma cohort, and 
did not demonstrate an association with trauma- 
related ALI. 
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Table 4 Comparison of PRDX6 discovery SNPs with 1000 Genomes database 



PRDX6 SNPs 


1000 Genomes Comparison 


PRDX6 5NPs 


1000 Genomes Comparison 


nnnv/" 1 71 7aaai a 

rKUXo_ i /i /oyyi 0 


Matched 


nnnv/" 171 7 a at a 1 

rKDX6_ I / I /09b4 I 


Not Matched 


nnnv/' 1 71 71 1071 

rKDXo_ 1/1/1 88/z 


Matched 


nnnv/' 17171A117 

rKDX6_l / I /1 082/ 


Not Matched 


rKDXo_l /I /1 41 0/ 


Matched 


nnnv/ - ni7in/ a a 

rKDX6_ 1 / 1 / 1 0490 


Not Matched 


rKUXo_l /I /zolol 


Matched 


nnPiV/" 1 71 71 A77C 

rKDX6_ I /I / I 0//b 


Not Matched 


nnnv/ - i7minio 

rKDXo_l /l /Zd9\o 


Matched 


nnnv/" 1 71 71 non 

rKDX6_l / 1 /10821 


Not Matched 


rKDXo_ I / I /24000 


Matched 


nnnv/' 1 71 71 1 mn 

rKDX6_l /I /I 1029 


Not Matched 


rKDXo_l /I /24128 


Matched 


rKDX6_l / 1 / 1284b 


Not Matched 


rKDXo_l / I /z418z 


Matched 


nnpiv/ - 1 71 71 i£n/ 

rKDX6_l /I /I J6y4 


Not Matched 


rKDXo_l /] /zolzz 


Matched 


nnnv/ - 1 71 71 1710 

rKDX6_l / 1 /13/38 


Not Matched 


rKDXo_l /] /zo!83 


Matched 


nnnv/ - 171711A1A 

rKDX6_l /I /13919 


Not Matched 


rKDXo_ 1 / 1 /zozo/ 


Matched 


nnnv/ - 1 71 71 aooa 

rKDX6_ I / I / 1 4984 


Not Matched 


rKDXo_ 1 /I /zbozo 


Matched 


nnnv/" 1 71 71 mm 

rKDX6_l /I /Ib0l9 


Not Matched 


nnnv/" 1 717171 

rKDXo_ I /l /z/l4o 


Matched 


nnnv/" 1 71 71 ncoc 

rKDX6_l / I / Ibb96 


Not Matched 


nflPiV/' 17171707)1 

rKDXo_ I /I /z/8Jl 


Matched 


nnP\V/" 17171/~AA7 

rKDX6_l /I /1 600/ 


Not Matched 


nnnv/" 171710/1^ 

rKDXo_ I / I /z84 1 0 


Matched 


nnnv/" 1 71 71 /Tin 

rKDX6_l / 1 / 1 6082 


Not Matched 


nnnv/' 1 71 71Q A ETC 

rKUX6_ 1 / I /z84bb 


Matched 


rKDX6_l /I /16bb4 


Not Matched 


1 A 7 rinm 

rs!0/b808l 


Matched 


nnnv/ - 1 71 71 /T7i 

rKDX6_l / 1 /16b/2 


Not Matched 


„i ir 7»" 1 71 

rsl lb/61 /4 


Matched 


rKDX6_l / I /16b82 


Not Matched 


„-1 171/11 /II 

rslz/39142 


Matched 


nnnv/" 1 71 71/T0/ 

rKDX6_ I / I / I 6b84 


Not Matched 


,,ii') 7/- im 


Matched 


rKDX6_l /I /1 6608 


Not Matched 


„1 11 —if A A~! 

rs 1 88/644/ 


Matched 


nnnv/ 1 71 711/1 ni 

rKDX6_l / 1 /28408 


Not Matched 


1 AAA 

rszOOO 


Matched 


nnpiv/ - 1 71 71100A 

rKDX6_l /I /28889 


Not Matched 


„.111/| 1/" r ^ 

rs889426b4 


Matched 


nnnv/" 1 71 71/110^ 

rKDX6_ 1 / 1 / 24286 


Not Matched 


rs841 29b68 


Matched 


nnnv/" ni 7i^njin 

rKDX6_l / 1 / 24949 


Not Matched 


rs84282688 


Matched 


nnnv/ - 1 71 7ir/nr 

rKDX6_ 1 / 1 /2b48b 


Not Matched 


.•*•") AC 1 A7 A/~ 

rs846l 9/06 


Matched 


nnnv/ - 1 71 7ir^:oi 

rKDX6_ I / I /2b68l 


Not Matched 


1 / A770/~ /I 

rs849//864 


Matched 


nnnv/ 1 71 7iaa/a 

rKDX6_ 1 / 1 /29049 


Not Matched 


rsobl 33/3b 


Matched 


rs8888b86 


Not Matched 


, r iri r 17A 1 

rsobl bz/Ul 


Matched 






rs8b244806 


Matched 






, r irir o/" A A 

rs8b8b8649 


Matched 






C A A i r AT 

rs8b441 b46 


Matched 






Kr1C(T/l77^A 

rs3bb4//40 


Matched 






l-^ 1 7AAAO 

rs8b6/9908 


Matched 






, r ir7/i Al /1 1 

rs8b/49242 


Matched 






rr- 1 C O AA/~ AO 

rs8b899698 


Matched 






, r irni 0110 

rs8b9 1 8828 


Matched 






rs86UUb98 i 


Matched 






rs4354572 


Matched 






rs4382766 


Matched 






rs461 1 


Matched 






rs57032935 


Matched 






rs60587131 


Matched 






rs6664925 


Matched 






rs667H41 


Matched 







rs6702835 Matched 



rs73 1 4 



Matched 
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Table 4 Comparison of PRDX6 discovery SNPs with 1000 Genomes database (Continued) 



rs7540065 


Matched 


rs9425722 


Matched 


rs9425723 


Matched 


rs942S724 


Matched 


rs9425727 


Matched 



We identified 43 novel variants among African Ameri- 
can and European American subjects with either ALI or 
control status. None of the 43 SNPs identified were in 
coding regions which may indicate that the Prdx6 pro- 
tein is highly conserved across phyla. Approximately 19 
kb on chromosome 1 was sequenced in order to achieve 
adequate coverage of the PRDX6 gene and flanking 5' 
and 3' UTRs. Special attention was given to the GRE2 
and ARE1 regions -749 to -737 and -357 to -349, 
respectively. The ARE1 within the PRDX6 promoter was 
shown to play a role in regulation of transcription and 
to be inducible under conditions of oxidative stress [26] 
and the GRE2 may be capable of binding transcription 
factors under oxidative stress conditions [28]. Due to 
the GC rich content of the region surrounding the 
ARE1, we were unable to optimize PCR reaction condi- 
tions in a way to prime through the secondary structure. 
The GRE2 region was sequenced, but no variation was 
noted. The GC rich region within the PRDX6 promoter 
might warrant further investigation since methylation of 
DNA cytosine residues are often found in the sequence 
context CpG. Several new sequencing approaches are 
emerging that target methylation sites using restriction 
enzyme treatment followed by sequence by synthesis 
[29]. 

In addition to comparing our results with NCBI's 
dbSNP, we compared our novel and known SNPs with 
the resequencing data registered in 1000 Genomes. The 
1000 Genomes project aims to find most genetic var- 
iants with frequencies of at least 1%. Thus far three 
sequencing projects contribute to the database, low cov- 
erage sequencing of 179 individuals from 4 populations, 
high coverage sequencing of 2 mother-father-child trios, 
and exon targeting sequencing of 697 individuals from 7 



populations [30]. Although 1000 Genomes aims to iden- 
tify over 95% of variation in any individual, 27 of our 
novel SNPs and 1 previously recorded SNP are not pre- 
sent in the database, signifying a need for resequencing 
of extreme phenotypes, such as ALI cases. 

Novel and previously recorded SNPs in the 5' UTR 
and first intron of PRDX6 were submitted to TESS to 
determine their likelihood of being in transcription fac- 
tor binding sites. We found 19 motifs in the reference 
sequences that are capable of binding known transcrip- 
tion factors and 21 in the alternative sequence. A com- 
parison between the results of the reference sequence 
search and the alternative revealed that in most cases, 
the SNP of interest changes the motif enough to cause a 
different transcription factor to bind that site or can 
cause a binding site to disappear and vice versa. After 
comparison with the ENCODE data, we found that our 
sequences have not yet been shown to bind the three 
overlapping transcription factors tested in ENCODE 
experimentally. 

Known SNPs validated in the sequencing effort were 
compared using a Patrocles search query for miRNA 
target sites within PRDX6 to determine if any of our 
SNPs were in putative target sites for miRNAs. Three of 
the eight SNPs returned from the search corresponded 
with our known SNPs. Only one of the three SNPs was 
found to have a corresponding known miRNA (miR- 
942). Some miRNAs are known to control the expres- 
sion of genes at the posttranscriptional level [31]. How- 
ever, very limited data are available on miR-942. 

We performed an association study for ALI using 
newly uncovered SNPs and SNPs selected from Hap- 
map and NCBI's dbSNP and observed no significant 
association between any of the SNPs in this study and 



GRE2 
ARE ' 



Mill IIMIIIIII 



-■— ■- 



Exon 1 



Exon 2 



Exon 3 Exon 4 



Exon5 



< =► < > < > < — > < > 

Figure 1 PRDX6 gene diagram with novel SNP positions. A schematic of the PRDX6 gene is presented with vertical lines and arrows above 
the gene indicating novel SNPs and regions of interest, respectively. The horizontal arrows below the schematic are representative of the 
regions of PRDX6 that were successfully sequenced. 
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Table 5 Potential transcription factor binding sites within the PRDX6 gene 



SNP 




Location 


Base Change 


Reference Sequence 


Alternative Sequence 


rKUXo_ 1 /I /Uyb41 




5' UTR 


G/C 




n/a 


Spl, CACGC binding factor, PuF 


1 ~3 1 1 C A A~7 

rsl 55/o44/ 




5' UTR 


A/G 




GATA-1 


GATA-1 


nnnvi' 1 "71 7nnm r\ 

rKUXo_ I /i /uyyi 0 




5' UTR 


G/T 




n/a 


II 1; DL DD 

IL-o Kb-br 


rr- 1 n"7C 1 AO 1 




5' UTR 


C/T 




GATA-3 


GATA-1 


rKUXo_l /I /I03z/ 




5' UTR 


CAT 




L/tbrbeta 


L/bbrbeta 


rKUXo_l /I /1U//5 




5' UTR 


G/C 




c-Myb 


n/a 


rsy4zD/2z 




5' UTR 


CAT 




n/a 


c-Myc 


, r Tri r T7A 1 

rsJbl bi/Ul 




5' UTR 


A/G 




n/a 


~r n /1 r tdh ~t r i i ri 

IMF, I br, IFIIU 


rsl l/59\4l 




5' UTR 


A/G 




IPFl, Isl-l, IFF! 


IPF1 


rr- a ~> r a r 7i 

rs4/jb4b/z 




5' UTR 


C/T 




L/tDrbeta 


Arm ATDn d 

AFP1, A I dFI-d 


rsl Ib/o I /4 




5' UTR 


G/T 




H4TF-1 


r~rr t\ a r TCI i r~\ "TDD 

b I F, I Mb, I HID, I dP 


nnriVA - 1 71 71 n/r 

rKUXo_ 1/1/1 zJ4b 




5' UTR 


A/T 




n/a 


bKY, ILb-4t 


rKUXo_ 1 / 1 / I Zd4d 




5' UTR 


A/T 




n/a 


SRY 


vr-l C A A CAT 

rsro-TI 1 d4d 




5' UTR 


C/T 




Sp 1 , ETF 


CACCC-binding factor, Spl 


rsobloo/ib 




5' UTR 


CAT 




n/a 


GACGC-binding factor, Spl 


rsoo/l I4l 




intron 


G/T 




GK alpha, rK, FK A, GK beta 


n/a 


nnnv/" 1 "71 "71 "D^oo 

rKUXo_l /I /li/^8 




intron 


G/T 




n/a 


TEF-1 


nnrivA - 1 -71 ~7i no - ?") 

rKUXo_ I /I /I io/z 




intron 


C/T 




n/a 


H4TF-1 


rsJby 1 oizo 




intron 


A/G 




IPF1 


n/ a 


rsJjy4zob4 




intron 


A/G 




c-Myb 


n/a 


nnnvi' 1 "71 "71 Ar\o a 

rKUXo_l /I /I4y84 




intron 


L/ 




TBP 


GATA-1 


PRHYin 1 71 71 ^mQ 
rnUAD_ 1 / 1 / 1 DU 1 y 




intron 


A/G 




CZATA 3 
urt 1 A J 


i^ATA 1 
\jr\ I A I 


rs9425723 




intron 


A/G 




TBP, TF1ID 


AP-1, c-Jun 


rs9425724 




intron 


A/G 




USF1 


AP-1, AP-4, CCK-la, c-Myc, CREB, Max1, USF1 


PRDX6_171715596 




intron 


A/G 




c-Myb, c-Ets-2 


n/a 
1 1/ ci 


PRDX6_171716007 




intron 


A/G 




n/a 


<^r\1 f~/PRPhota C AC{~{~ hinHinn fartr-ir 
jpi, WCDrUcld, l^A^^,^, L)l [ lUI [ ly IdLLUI 


PRDX6_171716572 




intron 


A/T 




TMF, TFIID, ETF, TBP 


n/a 


PRDX6_171716582 




intron 


A/G 




GATA-1, GATA-3 


n/a 


PRDX6J71716584 




intron 


A/T 




GATA-1, GATA-3 


SRY 


*Transcription factor binding site modeling 


was performed using TESS. 




Table 6 PRDX6 SNPs thought to be miRNA target sites 




SNP ID 


Chromosome Position 


Base Change Ancestral 


Derived miRNA 


rs461 1 


17' 


1 723640 




T/G 


TTGGTGGT 


CTGGTGCT 


rs 15268 


17' 


723695 




C/A 




AGCAATTA hsa-miR-302f 
ATTACATA hsa-miR-380 


rs321 1528 


17' 


724201 




G/A 


CTGGGGGA 


hsa-miR-361-3p 


rs36005931 


1/ 


724224 




A/G 




GTGCCTTC 
TGTGCCTT 


rs35820016 


17 


724277 




T/A 


TTTTGCTA 


hsa-miR-548p 

TTGCAATA 


rs 1804053 


17 


724351 




G/A 




ATGTAGCA hsa-miR-221 
hsa-miR-222 


rsl 1544001 


17' 


724433 




A/G 




GTGCATGA 
TGCATGAA 
TGTGCATG 


rs2000 


17' 


724457 




G/A 


AGAGAAGA 


hsa-miR-942 



*miRNA target site modeling was performed using Patrocles. 
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Table 7 Association of 37 PRDX6 genotypes and risk of ALI using and additive model in a population of African and 
European Americans with major trauma 









European Americans 






African Americans 




SNP 


Source 


MAF 
(ALI) 


MAF 
(non-ALI) 


Missing Genotype 
Frequency 


p value 


MAF 
(ALI) 


MAF 
(non-ALI) 


Missing Genotype 
Frequency 


p value 


hCV1 948447 


Tagging 


022 


0.20 


0.016 


0538 


0.06 


0.05 


0.019 


0.507 


hCV25599136 


Tagging 


0.00 


0.00 


0.161 


NA 


0.03 


0.03 


0.230 


0.842 


hCV25599144 


Taaaina 
^yy" 'y 


0.00 


0.01 


0.399 


0.999 


0.00 


0.00 


0.296 


NA 


hCV9040425 


Tagging 


0.27 


0.25 


0.021 


0.739 


0.38 


0.35 


0.019 


0.725 


hCV9040434 


Tagging 


0.25 


0.24 


0.407 


0.838 


0.31 


0.29 


0.307 


0.537 


Position23855054 


Sequencing 


0.00 


0.00 


0.012 


NA 


0.09 


0.08 


0.019 


0.535 


Position23855203 


Sequencing 


0.00 


0.00 


0.679 


NA 


0.10 


0.1 7 


0.662 


0.268 


Position23859396 


Sequencing 


0.00 


0.00 


0.012 


NA 


0.04 


0.02 


0.019 


0.302 


Position23863249 


Sequencing 


0.00 


0.00 


0.012 


NA 


0.03 


0.03 


0.023 


0.577 


PRDX6_1 71 709910 


Sequencing 


0.03 


0.01 


0.436 


0.192 


0.04 


0.06 


0.494 


0505 


PRDX6_1 71 71 1459 


Sequencing 


0.00 


0.00 


0.021 


NA 


0.1 1 


0.18 


0.043 


0.088 


PRDX6_1 71 71 3872 


Sequencing 


0.00 


0.01 


0.037 


0.999 


0.08 


0.05 


0.070 


0.331 


PRDX6_1 71 71 501 9 


Sequencing 


0.00 


0.00 


0.029 


0.999 


0.05 


0.05 


0.058 


0.879 


PRDX6_1 71 724949 


Sequencing 


0.22 


0.19 


0.029 


0.455 


0.07 


0.05 


0.043 


0.481 


rs1 0753081 


Sequencing 


0.34 


0.29 


0.008 


0.246 


0.37 


0.34 


0.000 


0.676 


rs2000 


Tagging 


0.05 


0.05 


0.045 


0.966 


0.02 


0.02 


0.031 


0.793 


rs33942654 


Sequencing 


0.23 


0.20 


0.021 


0.321 


0.34 


0.29 


0.054 


0.138 


rs34 129563 


Sequencing 


0.07 


0.03 


0.021 


0.080 


0.01 


0.01 


0.039 


0559 


rs346 19706 


Sequencing 


0.08 


0.08 


0.037 


0.900 


0.03 


0.01 


0.047 


0.287 


rs35244306 


Sequencing 


0.00 


0.01 


0.021 


0.999 


0.1 1 


0.08 


0.039 


0.517 


rs35 749242 


Sequencing 


0.00 


0.01 


0.021 


0.999 


0.08 


0.06 


0.039 


0.516 


rs4354572 


Sequencing 


0.00 


0.00 


0.354 


NA 


0.1 1 


0.1 7 


0.358 


0.251 


rs4382766 


Sequencing 


0.33 


0.29 


0.025 


0.312 


0.35 


0.34 


0.047 


0.830 


rs491 6362 


Tagging 


0.34 


0.29 


0.021 


0.308 


0.35 


0.32 


0.019 


0.670 


rs57032935 


Sequencing 


0.23 


0.19 


0.037 


0296 


0.34 


0.29 


0.082 


0.131 


rs6671 141 


Sequencing 


0.23 


0.19 


0.210 


0421 


0.38 


0.33 


0.163 


0.170 


rs66991 79 


Tagging 


0.00 


0.00 


0.214 


NA 


0.01 


0.00 


0.1 71 


0.999 


rs6702828 


Tagging 


0.00 


0.00 


0.012 


NA 


0.01 


0.00 


0.027 


0.778 


rs6702835 


Sequencing 


0.27 


0.25 


0.008 


0.606 


0.42 


0.37 


0.000 


0.493 


rs7314 


Sequencing 


0.27 


0.25 


0.045 


0.666 


0.34 


0.30 


0.078 


0.571 


rs7367963 


Tagging 


0.34 


0.29 


0.029 


0.284 


0.35 


0.31 


0.019 


0.574 


rs7521536 


Tagging 


0.24 


0.20 


0.025 


0.360 


0.32 


0.29 


0.043 


0.320 


rs7529377 


Tagging 


0.24 


0.20 


0.016 


0.331 


0.28 


0.26 


0.027 


0.321 


rs7540065 


Sequencing 


0.27 


0.25 


0.021 


0.726 


0.37 


0.35 


0.039 


0.997 


rs9 12767 


Tagging 


0.24 


0.20 


0.016 


0.331 


0.29 


0.26 


0.027 


0.240 


rs9425722 


Sequencing 


0.00 


0.00 


0.342 


NA 


0.08 


0.17 


0.335 


0.048 


rs9425725 


Tagging 


0.00 


0.00 


0.012 


NA 


0.13 


0.20 


0.027 


0.091 



ALI. This lack of association may be due to several 
causes. First, the detectable effect size is modest 
because of sample size limitations. We genotyped 513 
subjects to test for an association between our selected 
SNPs and ALI, but this sample size was inadequate to 
detect relative risks below 1.93 and 1.69 for alleles 



with MAFs of 0.05 and 0.10, respectively. Second, our 
analyses were limited to patients with severe trauma. 
Thus, our study did not evaluate a possible association 
with other causes of ALI such as sepsis. Finally, it is 
possible that PRDX6 genetic variation may not modify 
the risk of ALI. 
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Table 8 Multivariate analysis adjusted for age and injury severity score 



African Americans European Americans 



SNP 


ORtrend 


Ptrend 


ORdom 


Pdom 


ORrec 


Prec 


ORtrend 


Ptrend 


ORdom 


Pdom 


ORrec 


Prec 


U f~\ /1 f~\AO A A~7 

nLVl94844/ 


1 .33 


0.507 


1 .19 


0.705 


7 07C 1 AO 

3.8/t+oy 


0.999 


1 .16 


0.538 


1 .06 


0.840 


2.-1 1 


0.204 


hLVzDb99l 36 


1 .1 5 


0.842 


1 .1 5 


0.842 


N A 


NA 


Nl A 


NA 


N A 


N A 


NA 


NA 


V^f~\ /7 c r aa 

hLVibbyy 144 


NA 


NA 


NA 


NA 


NA 


NA 


4.1 9E-1 0 


0.999 


4.1 9E-10 


0.999 


NA 


NA 


hLVy04u4zb 


1 .08 


0.725 


1 .08 


0.781 


1.1 3 


0.760 


1 .08 


0.739 


1 .04 


0.885 


1 .32 


0.593 


U/~\ /aa a r\A 7 a 

hLVy 040434 


1 .17 


0.537 


1 .30 


0.442 


1 .04 


0.947 


1 .06 


0.838 


0.94 


0.871 


2.05 


0.365 


rOSItlonz3obb0b4 


1 .25 


0.535 


1 .39 


0.398 


z.u/t-oy 


0.999 


Nl A 


NA 


NA 


N A 


NA 


MA 


n^.-:+-: n ^77orr7A7 


0.56 


0.268 


0.60 


0.374 


1 .52E-09 


0.999 


N A 


NA 


NA 


NA 


NA 


NA 


n^^-i^i^^no n c a 7 c\f 

rOSItlonZjobyjyo 


1 .73 


0.302 


2.17 


0.1 90 


1 .70E-09 


0.999 


Nl A 


NA 


NA 


NA 


NA 


NA 


rositionz38o3z4y 


1 .36 


0.577 


1 .61 


0.431 


1 .70E-09 


0.999 


N A 


NA 


NA 


NA 


NA 


NA 


nnnv/" i717aaaia 

rKUX6_ i / 1 /oyyi 0 


0.66 


0.505 


0.72 


0.636 


1 .74E-09 


0.999 


5.13 


0.1 92 


5.13 


0.1 92 


NA 


NA 


rKDX6_l /I /I I4by 


0.58 


0.088 


0.61 


0.146 


-) ATC !~\C\ 

z.0zt-09 


0.998 


N A 


NA 


NA 


N A 


NA 


NA 


rKDX6_ 1 / 1 /I 38/z 


1 .49 


0.331 


1 .69 


0.229 


1 .06E-09 


0.999 


i .zoHjy 


0.999 


1 AOC A A 

I .z8t-oy 


0.999 


NA 


NA 


nnpiv/ - 1 71 71 rm n 

rKDX6_l /I /IbUiy 


0.93 


0.879 


1 .01 


0.989 


1 .07E-09 


0.999 


1 .77E-09 


0.999 


1 .77E-09 


0.999 


NA 


NA 


rKUX6_ I / I /z4y4y 


1 .35 


0.481 


1 .21 


0.681 


A 1 O IZ i Af7 

4. 1 8t+09 


0.999 


1 .20 


0.455 


1 .1 0 


0.740 


2.50 


0.1 86 


1 r\~ir 7 ao n 

rsl O/biOol 


1 .09 


0.676 


1 .1 9 


0.543 


0.98 


0.970 


1 .28 


0.246 


1 .34 


0.288 


1 .44 


0.439 


rszOOU 


1 .21 


0.793 


1 .21 


0.793 


NA 


NA 


0.98 


0.966 


1 .25 


0.661 


1 7 AC A A 

1 ,30t-09 


0.999 


rs33y4Z0D4 


1 .39 


0.1 38 


1 .78 


0.057 


1 .05 


0.923 


1 .27 


0.321 


1.14 


0.640 


2.96 


0.1 03 


7 ,1 1 7 Ar^: 7 

rs341 zybo3 


1 .73 


0.559 


1 .73 


0.559 


N! A 


NA 


2.16 


0.080 


2.09 


0.1 15 


7 A 7 C 1 A A 

3.4zt+09 


0.999 


rs34ol y/06 


2.13 


0.287 


2. 1 3 


0.287 


NA 


NA 


0.96 


0.900 


0.89 


0.773 


2.32 


0.554 


rs3bz4430o 


1 .25 


0.51 7 


1 .41 


0.349 


1 .34E-09 


0.999 


1 aac t~\r\ 

i .z9t-(jy 


0.999 


1 AAC AA 

i .zyt-uy 


0.999 


NA 


NA 


7 r 7 ^ m A 7 

rs3b/4yz4z 


1 .30 


0.516 


1 A 3 


0.399 


1 Anr AA 

1 .oyt-uy 


0.999 


1 AAC AA 

i .zyt-uy 


0.999 


1 AAC AA 

i .zyt-uy 


0.999 


NA 


NA 


rs43b4b/z 


0.65 


0.251 


0.70 


0.386 


1 .85E-09 


0.999 


Nl A 


NA 


NA 


N A 


NA 


MA 


rs438z/oo 


0.96 


0.830 


0.94 


0.838 


0.94 


0.892 


1 2A 


0.31 2 


1 .26 


0.409 


1 .50 


0.394 


„^ / A1 £L~> £L~\ 

rs49 1 63 oz 


1 .09 


0.670 


1 .1 9 


0.551 


0.99 


0.987 


1 .24 


0.308 


1 .28 


0.381 


1 A A 


0.436 


rsb/03zy35 


1 A 0 


0.1 31 


1 .8 1 


0.053 


1.07 


0.888 


1 .29 


0.296 


1 .16 


0.600 


2.98 


0.1 01 


7 1 1 /11 

rsoo/l 1 41 


1 .37 


0.1 70 


1 .43 


0.256 


1 .64 


0.261 


1 .24 


0.421 


1 .21 


0.549 


1 .86 


0.398 


rr-ff(~\r\ 1 7D 

rsooyyi /y 


1 1 a rr. 1 aa 

1 .i4t+oy 


0.999 


1 1 a r~ i r\r\ 


0.999 


NA 


NA 


N A 


NA 


N A 


NA 


NA 


NA 


■-r-f" 7A7 O 7 O 

rso/Uzozo 


1 .59 


0.778 


1 .59 


0.778 


N! A 


NA 


Nl A 


NA 


N A 


N! A 


Nl A 


NA 


I iO/ UZOJJ 


1 15 


0 493 


1 78 


0 063 


061 


0 242 


1 1 2 


0 606 


1 1 2 


0 686 


1 28 


0 632 


rs7314 


1.13 


0.571 


1.26 


0.437 


1.02 


0.963 


1.10 


0.666 


1.07 


0.820 


1.38 


0.542 


rs7367963 


1.12 


0.574 


1.13 


0.676 


1.26 


0.589 


1.26 


0.284 


1.30 


0.347 


1.45 


0.432 


rs7521536 


1.24 


0.320 


1.47 


0.191 


1.01 


0.977 


1.24 


0.360 


1.11 


0.705 


2.95 


0.104 


rs7529377 


1.25 


0.321 


1.31 


0.360 


1.40 


0.506 


1.26 


0.331 


1.14 


0.651 


2.93 


0.106 


rs7540065 


1.00 


0.997 


1.03 


0.922 


0.95 


0.896 


1.08 


0.726 


1.04 


0.883 


1.35 


0.570 


rs9 12767 


1.30 


0.240 


1.40 


0.250 


1.40 


0.505 


1.26 


0.331 


1.14 


0.651 


2.93 


0.106 


rs9425722 


0.44 


0.048 


0.44 


0.065 


1 .97E-09 


0.999 


NA 


NA 


NA 


NA 


NA 


NA 


rs9425725 


0.60 


0.091 


0.63 


0.154 


2.03E-09 


0.998 


NA 


NA 


NA 


NA 


NA 


NA 



The genotype data were used to construct haplotype 
blocks to better assess the PRDX6 gene structure. Hap- 
lotype analysis plays an important role in association 
studies between genotype and phenotype, since SNPs 
found to be in strong LD can capture most of the 
genetic variation across fairly large regions [24]. The 
haplotype blocks constructed from our genotype data 
did not show strong linkage disequilibrium using 



confidence intervals, therefore tagging SNP strategies in 
future studies should be approached with caution. 

Our resequencing data did not show any variation in 
the coding region of PRDX6. Had nonsynonymous 
SNPs been discovered, it would have prompted us to 
investigate whether any of these SNPs had any effect on 
protein structure, which could cause a loss of function 
in Prdx6. Since we cannot make a connection between 
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Figure 2 Haplotype Structure of 17 PRDX6 SNPs in African Americans with r Values (N = 259) 



coding region SNPs and conformational changes in the 
protein, we examined regulatory effects. We found sev- 
eral promoter SNPs that change the sequence of poten- 
tial TFBSs based on conservation data. We were unable 
to confirm that these sequences were in fact TFBSs due 
to the lack of available data. However if any of our pro- 
moter SNPs showed a significant association with ALI 
or another phenotype perhaps using a larger sample 
size, future studies using promoter constructs could 
offer more information on upregulation of PRDX6. We 
also found several SNPs in the 3'UTR. It is possible that 
one or more of these SNPs is responsible for changing 
an miRNA binding site, thus repressing protein 
translation. 

Our study has several limitations. One potential lim- 
itation of this study is the number of genotype call fail- 
ures. Ten and nine markers for African Americans and 
European Americans respectively were eliminated from 
our analysis since they were under the 95% completion 
rate cut-off. This high rate of genotype failure was due 



to difficulties with consistent assay performance rather 
than DNA quality. If these genotypes had been obtained, 
it is a possible that an association may have been 
observed. Also, we did not adjust our results for ances- 
try informative markers (AIMs). Instead our population 
was stratified based on skin color, which may not be an 
adequate proxy for population admixture effects. 
Another possible limitation is a candidate gene approach 
that focused on a single gene: PRDX6. ALI risk may be 
considered a complex phenotype, and thus likely is not 
fully explained by a variation in a single gene [10]. 
Finally, we only tested for association in patients with 
ALI from severe trauma. Thus, it is possible that PRDX6 
may play a role in the initiation or severity of ALI after 
other insults, including sepsis, or in determining recov- 
ery from ALL 

PRDX6 has been shown to play a role not only in ALI, 
but other diseases as well. A recent studied demon- 
strated that PRDX6 promotes lung cancer metastasis 
and invasion via phospholipase A 2 activity in mice [32]. 
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Figure 3 Haplotype Structure of 17 PRDX6 SNPs in European Americans with r 2 Values (N = 254) 



Table 9 Haplotype Analysis among African and European Americans 







African Americans 








European Americans 




Block 


Haplotype 


Population Frequencies 


p value 


Block 


Haplotype 


Population Frequencies 


p value 


Block 1 


GT 


0.654 


0.719 


Block 1 


ACA 


0.690 


0.520 




AC 


0.330 


0.833 




GTA 


0.230 


0.479 




GC 


0.016 


0.563 




GTG 


0.080 


0.999 


Block 2 


TGTTACGTG 


0.282 


0.449 


Block 2 


GGACGTG 


0.734 


0.883 




TATTGCACA 


0.212 


0.386 




AAGTACA 


0.202 


0.634 




CATCACATG 


0.153 


0.083 




GAACATG 


0.050 


0.297 




TACTACGTG 


0.099 


0.485 












TGTTACATG 


0.072 


0.455 












TATTACATG 


0.055 


0.935 












TATTGTACA 


0.051 


0.894 












TATTACACG 


0.036 


0.969 












TATCACATG 


0.028 


0.953 
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Another publication reported that PRDX6 transfected 
breast cancer cells metastasized more readily to the 
lungs when compared with control cells [26] . It is possi- 
ble that our novel SNPs may function in lung cancer as 
well as ALL The interaction between GSTpi and PRDX6 
is another interesting subject for future studies. GSTpi 
expression is elevated in tumors from a variety of can- 
cers, including lung cancer, compared to normal tissue 
[33]. Testing gene-gene interactions between PRDX6 
and GSTpi would be an interesting future direction 
both in ALI and other diseases such as cancer. 

Conclusion 

In conclusion, this study revealed novel SNPs within the 
important anti-oxidant PRDX6 gene and its 5' and 3' 
flanking regions via direct sequencing. Several of these 
variants have putative function and may be useful for 
future gene association studies. Although there was no 
association discovered between our novel and tagging 
SNPs with trauma-related ALI, future studies may focus 
on the role of PRDX6 variation in other at risk groups, as 
well as other diseases. 
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